BreaKmer: detection of structural variation in targeted massively parallel sequencing data using kmers
نویسندگان
چکیده
Genomic structural variation (SV), a common hallmark of cancer, has important predictive and therapeutic implications. However, accurately detecting SV using high-throughput sequencing data remains challenging, especially for 'targeted' resequencing efforts. This is critically important in the clinical setting where targeted resequencing is frequently being applied to rapidly assess clinically actionable mutations in tumor biopsies in a cost-effective manner. We present BreaKmer, a novel approach that uses a 'kmer' strategy to assemble misaligned sequence reads for predicting insertions, deletions, inversions, tandem duplications and translocations at base-pair resolution in targeted resequencing data. Variants are predicted by realigning an assembled consensus sequence created from sequence reads that were abnormally aligned to the reference genome. Using targeted resequencing data from tumor specimens with orthogonally validated SV, non-tumor samples and whole-genome sequencing data, BreaKmer had a 97.4% overall sensitivity for known events and predicted 17 positively validated, novel variants. Relative to four publically available algorithms, BreaKmer detected SV with increased sensitivity and limited calls in non-tumor samples, key features for variant analysis of tumor specimens in both the clinical and research settings.
منابع مشابه
Detection of structural mosaicism from targeted and whole-genome sequencing data.
Structural mosaic abnormalities are large post-zygotic mutations present in a subset of cells and have been implicated in developmental disorders and cancer. Such mutations have been conventionally assessed in clinical diagnostics using cytogenetic or microarray testing. Modern disease studies rely heavily on exome sequencing, yet an adequate method for the detection of structural mosaicism usi...
متن کاملA Model-Based Clustering Method for Genomic Structural Variant Prediction and Genotyping Using Paired-End Sequencing Data
Structural variation (SV) has been reported to be associated with numerous diseases such as cancer. With the advent of next generation sequencing (NGS) technologies, various types of SV can be potentially identified. We propose a model based clustering approach utilizing a set of features defined for each type of SV events. Our method, termed SVMiner, not only provides a probability score for e...
متن کاملTIGRA: a targeted iterative graph routing assembler for breakpoint assembly.
Recent progress in next-generation sequencing has greatly facilitated our study of genomic structural variation. Unlike single nucleotide variants and small indels, many structural variants have not been completely characterized at nucleotide resolution. Deriving the complete sequences underlying such breakpoints is crucial for not only accurate discovery, but also for the functional characteri...
متن کاملNoninvasive Prenatal Diagnosis of Fetal Trisomy 21 by Allelic Ratio Analysis Using Targeted Massively Parallel Sequencing of Maternal Plasma DNA
BACKGROUND Plasma DNA obtained from a pregnant woman contains a mixture of maternal and fetal DNA. The fetal DNA proportion in maternal plasma is relatively consistent as determined using polymorphic genetic markers across different chromosomes in euploid pregnancies. For aneuploid pregnancies, the observed fetal DNA proportion measured using polymorphic genetic markers for the aneuploid chromo...
متن کاملNoninvasive prenatal diagnosis of duchenne muscular dystrophy: comprehensive genetic diagnosis in carrier, proband, and fetus.
BACKGROUND Noninvasive prenatal diagnosis of monogenic disorders using maternal plasma and targeted massively parallel sequencing is being investigated actively. We previously demonstrated that comprehensive genetic diagnosis of a Duchenne muscular dystrophy (DMD) patient is feasible using a single targeted sequencing platform. Here we demonstrate the applicability of this approach to carrier d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 43 شماره
صفحات -
تاریخ انتشار 2015